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COMPOSITIONS AND METHODS FOR DETERMINING 
THE REPLICATION CAPACITY OF A PATHOGENIC VIRUS 

This application is entitled to and claims priority to U.S. Provisional Application 
No. 60/393,306, filed July 1, 2002, the contents of which is hereby incorporated by reference 
in its entirety. 

5 1. FIELD OF INVENTION 

This invention relates to compositions and methods for determining the replication 
capacity of a virus. The compositions and methods are useful for identifying effective drug 
regimens for the treatment of viral infections, and identifying and determining the biological 
effectiveness of potential therapeutic compounds. 

10 2. BACKGROUND OF THE INVENTION 

More than 60 million people have been infected with the human immunodeficiency 
virus ("HIV"), the causative agent of acquired immune deficiency syndrome ("AIDS"), since 
the early 1980s. See Lucas, 2002, Lepr Rev. 73(1):64-71. HIV/AIDS is now the leading 
cause of death in sub-Saharan Africa, and is the fourth biggest killer worldwide. At the end 
15 of 200 1 , an estimated 40 million people were living with HIV globally. See Norris, 2002, 
Radiol Technol. 73(4):339-363. 

Modern anti-HIV drugs target different stages of the HIV life cycle and a variety of 
enzymes essential for HIV's replication and/or survival. Amongst the drugs that have so far 
been approved for AIDS therapy are nucleoside reverse transcriptase inhibitors ("NRTI") 
20 such as AZT, ddl, ddC, d4T, 3TC, abacavir, nucleotide reverse transcriptase inhibitors such 
as tenofovir, non-nucleoside reverse transcriptase inhibitors ("NNRTI") such as nevirapine, 
efavirenz, delavirdine and protease inhibitors such as saquinavir, ritonavir, indinavir, 
nelfinavir, amprenavir and lopinavir. 

One consequence of the action of an anti-viral drug is that it can exert sufficient 
25 selective pressure on virus replication to select for drug-resistant mutants (Herrmann et al. 9 
1977 \ Ann NY Acad Sci 284:632-637). With increasing drug exposure, the selective pressure 
on the replicating virus population increases to promote the more rapid emergence of drug 
resistant mutants. Many protease inhibitor resistance mutations and some NRTI resistance 
mutations are known to impair HIV-1 replication capacity to varying degrees. Typically, 
30 mutations conferring resistance to an antiviral drug reduce the replication capacity of the 
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mutant virus. See, e.g., Nijhuis et al, 2001, Curr Op Infect Diseases 14:23-28, incorporated 
herein by reference in its entirety. Changes in replication capacity of a virus are of major 
clinical importance because they can affect the response of a patient to anti-viral therapies. 
See id. However, the effects of NNRTI resistance mutations are largely uncharacterized and 
5 it is often assumed that NNRTI resistance is not associated with impaired viral replication. 
Thus, there is a need in the art for methods and compositions for determining the replication 
capacity of a NNRTI resistant virus. 

3. SUMMARY OF THE INVENTION 

The present invention provides methods and compositions for determining the 
10 replication capacity (also called the replication fitness) of a virus, for example, HIV, e.g., a 
non-nucleoside reverse transcriptase inhibitor (NNRTI) resistant HIV. The methods and 
compositions are based on an analysis of a panel of recombinant virus vectors created using 
site-directed mutagenesis containing one or more reverse transcriptase (RT) amino acid 
substitutions. The methods and compositions of the invention significantly improve the 
1 5 quality of life of a patient by providing information to the clinician useful for the design of 
more effective anti-viral treatment regimens. Also, by avoiding the administration of 
ineffective drugs, considerable time and money is saved. 

The methods for measuring replication fitness can be adapted to other viruses, 
including, but not limited to hepadnaviruses {e.g., human hepatitis B virus), flaviviruses {e.g., 
20 human hepatitis C virus) and herpesviruses {e.g., human cytomegalovirus). 

This invention further relates to a method for measuring the replication fitness of 
HIV-1 that exhibits reduced drug susceptibility to reverse transcriptase inhibitors and 
protease inhibitors. The methods for measuring replication fitness can be adapted to other 
classes of inhibitors of HIV-1 replication, including, but not limited to, integration, virus 
25 assembly, and virus attachment and entry. 

The invention further relates to a method for identifying mutations in protease or 
reverse transcriptase that alter replication fitness. 

The methods for identifying mutations that alter replication fitness described herein 
can be adapted to other components of HIV-1 replication, including, but not limited to, 
30 integration, virus assembly, and virus attachment and entry. 
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The present invention further relates to methods for quantifying the effect that specific 
mutations in protease or reverse transcriptase have on replication fitness. The methods for 
quantifying the affect that specific protease and reverse transcriptase mutations have on 
replication fitness can be adapted to mutations in other viral genes involved in HIV-1 
5 replication, including, but not limited to the gag, pol, and envelope genes. 

This invention further relates to the high incidence of patient samples with reduced 
replication fitness and the general correlation between reduced drug susceptibility and 
reduced replication fitness. More specifically, the present invention further relates to the 
occurrence of viruses with reduced fitness in patients receiving protease inhibitor and/or 
1 0 reverse transcriptase inhibitor treatment. 

The invention further relates to the incidence of patient samples with reduced 
replication fitness in which the reduction in fitness is due to altered protease processing of the 
gag polyprotein (p55). 

The invention further relates to the incidence of protease mutations in patient samples 
15 that exhibit low, moderate or normal (wild- type) replication fitness. 

The invention further relates to protease mutations that are frequently observed, either 
alone or in combination, in viruses that exhibit reduced replication capacity. 

The invention also relates to the incidence of patient samples with reduced replication 
fitness in which the reduction in fitness is due to altered reverse transcriptase activity. 

20 The invention relates to the occurrence of viruses with reduced replication fitness in 

patients failing antiretroviral drug treatment. 

The invention further relates to a method for using replication fitness measurements 
to guide the treatment of HIV-1, for example, to methods for using replication fitness 
measurements to guide the treatment of patients failing antiretroviral drug treatment or for 
25 using replication fitness measurements to guide the treatment of patients newly infected with 
HIV-1. The methods for using replication fitness measurements to guide the treatment of 
HIV-1 can be adapted to other viruses, including, but not limited to hepadnaviruses (e.g., 
human hepatitis B virus), flaviviruses (e.g., human hepatitis C virus) and herpesviruses (e.g., 
human cytomegalovirus). 

30 In one aspect, the present invention provides a method for determining whether a 

HIV, e.g., HIV-1, has an increased likelihood of having an impaired replication capacity, 
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comprising: detecting whether the reverse transcriptase encoded by said HIV exhibits the 
presence or absence of a mutation associated with impaired replication capacity at amino acid 
position 98, 100, 101, 103, 106, 108, 179, 181, 188, 190, 225 or 236 of the amino acid 
sequence of said reverse transcriptase, wherein the presence of said mutation indicates that 
5 the HIV has an increased likelihood of having impaired replication capacity, with the proviso 
that said mutation is not P236L. 

In another aspect, the present invention provides a method for determining whether a 
HIV, e.g., HIV-1, has an increased likelihood of having an impaired replication capacity, 
comprising detecting whether the reverse transcriptase encoded by said HIV exhibits the 
10 presence or absence of a mutation selected from the group consisting of A98G, L100I, 

K101E, K103N, V106A, V106I, V106M, Y181C, Y188A, Y188C, Y188H, Y188L, G190A, 
G190C, G190E, G190T, G190V, G190Q, G190S and G190V of the amino acid sequence of 
said reverse transcriptase, wherein the presence of said mutation indicates that the HIV has an 
increased likelihood of having impaired replication capacity. 

15 In another aspect, the present invention provides a method for determining whether a 

subject has an HIV, e.g., HIV-1, with an increased likelihood of having an impaired 
replication capacity, comprising detecting whether the reverse transcriptase encoded by said 
HIV exhibits the presence or absence of a mutation associated with impaired replication 
capacity at amino acid position 98, 100, 101, 103, 106, 108, 179, 181, 188, 190, 225 or 236 of 

20 the amino acid sequence of said reverse transcriptase, wherein the presence of said mutation 
indicates that the HIV has an increased likelihood of having impaired replication capacity, 
with the proviso that said mutation is not P236L. 

In another embodiment, said method comprises detecting the presence or absence of a 
mutation associated with impaired replication capacity at at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 

25 or 12 amino acid positions. In general, the methods can comprise detecting the presence or 
absence of any combinations of mutations listed herein associated with impaired replication 
capacity. For example, the method can comprise detecting the presence or absence of a 
mutation at at least two amino acid positions, such as amino acid positions 106 and 181, 103 
and 190, 103 and 236, 181 and 236, 103 and 188, 103 and 181, 100 and 103, or 98 and 181. 

30 In certain embodiments, such methods can comprise detecting the presence or absence of 

V106A and Y181C; K103N and G109S; P236L and K103N; P236L and Y181C; K103N and 
G190A; K103N and Y181C; K103N and Y188L; L100I and K103N; or Y181C and A98G. 
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Moreover, the method can comprise detecting the presence or absence of a mutation 
at at least three amino acid positions, such as amino acid positions 103, 181 and 236; 100, 
103, and 190; or 103, 181 and 225. In certain embodiments, such methods can comprise 
detecting the presence or absence of P236L, K103N and Y181C; L100I, K103N and G190S; 
5 or Kl 03N, Yl 8 1 C and P225H. 

In another aspect, the present invention provides a method for determining whether a 
subject has an HIV, e.g., HIV-1, with an increased likelihood of having an impaired 
replication capacity, comprising detecting whether the reverse transcriptase encoded by said 
HIV exhibits the presence or absence of a mutation selected from the group consisting of 
10 A98G, L100I, K101E, K103N, V106A, V106I, V106M, Y181C, Y188A, Y188C, Y188H, 
Y188L, G190A, G190C, G190E, G190T, G190V, G190Q, G190S and G190V of the amino 
acid sequence of said reverse transcriptase, wherein the presence of said mutation indicates 
that the HIV has an increased likelihood of having impaired replication capacity. 

in one embodiment, said mutation confers resistance to a non-nucleoside reverse 
15 transcriptase inhibitor ("NNRTI"). In another embodiment, said human immunodeficiency 
virus is human immunodeficiency virus type 1 (HIV-1). In another embodiment, said 
NNRTI is nevirapine ("NVP"), delavirdine ("DLV") or efavirenz ("EFV"). In another 
embodiment, said presence or absence of said mutation in said reverse transcriptase ("RT") is 
detected by hybridization with a sequence-specific oligonucleotide probe to a nucleic acid 
20 sequence of said human immunodeficiency virus encoding said mutation, wherein the 

occurrence of hybridization indicates said presence or absence of said mutation. In another 
embodiment, said presence or absence of said mutation in said RT is detected by determining 
the nucleic acid sequence encoding said mutation. In another embodiment, said presence or 
absence of said mutation in said RT is detected by amplifying the nucleic acid by, e.g., PCR. 
25 In another embodiment, said subject is undergoing or has undergone prior treatment with an 
anti-viral drug. In one embodiment, the anti-viral drug is an NNRTI. 

In another aspect, the present invention provides an isolated oligonucleotide between 
about 1 0 and about 40 nucleotides long encoding a portion of a HIV reverse transcriptase in a 
HIV that comprises a mutation at amino acid position 98, 100, 101, 103, 106, 108, 179, 181, 
30 188, 190, 225 or 236 of an amino acid sequence of said reverse transcriptase in said HIV, 

wherein the mutation is associated with reduced susceptibility to a protease inhibitor, with the 
proviso that said mutation is not P236L. 
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4. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 present a diagrammatic representation of a replication capacity assay. 

Figure 2 presents a chart summarizing the correlation between the presence of 
5 mutations and pairs of mutations and replication capacity. 

Figure 3 presents charts summarizing the replication capacity of recombinant viruses 
-containing different amino acid substitutions at the same position within reverse transcriptase. 

o 

Figure 4 presents charts demonstrating that replication capacity measurements made 
using the replication capacity assay are consistent with measurements made using a 
1 0 replication competition assay. 

Figure 5 presents charts demonstrating that L74V partially restores the impaired 
replication capacity of recombinant viruses containing G190 mutations. 

Figure 6 presents a chart demonstrating the replication capacity of patient derived 
viruses that contain detrimental G190 mutations. 

1 5 Figure 7 presents a chart showing that the effects of K103N and other NNRTI 

mutations are approximately additive. 

Figure 8 presents charts illustrating the relationship between drug concentration and 
replication capacity for different substitution mutations at the same amino acid position in 
reverse transcriptase. Figure 8A illustrates the relationship between delavirdine and 

20 replication capacity for different substitution mutations at the same amino acid position in 

reverse transcriptase. Figure 8B illustrates the relationship between efavirenz and replication 
capacity for different substitution mutations at the same amino acid position in reverse 
transcriptase. Figure 8C illustrates the relationship between nevirapine and replication 
capacity for different substitution mutations at the same amino acid position in reverse 

25 transcriptase. 

Figure 9 presents charts illustrating the NNRTI susceptibility distribution in the 
absence (white boxes) and presence (grey boxes) of K101P. 

Figure 10 presents charts illustrating the NNRTI susceptibility distribution in the 
absence (white boxes) or presence (grey boxes) of K103R+V179D. 
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DETAILED DESCRIPTION OF THE INVENTION 



The present invention provides methods and compositions for developing and using 
methods and compositions for determining the replication capacity of a non-nucleoside 
reverse transcriptase inhibitor (NNRTI) resistant virus. The methods and compositions are 
5 based on an analysis of a panel of recombinant virus vectors created using site-directed 

mutagenesis containing one or more reverse transcriptase (RT) amino acid substitutions. The 
methods and compositions of the invention significantly improve the quality of life of a 
patient by providing information to the clinician useful for the design of more effective anti- 
viral treatment regimens. Also, by avoiding the administration of ineffective drugs, 
10 considerable time and money is saved. 

5.1 Abbreviations 

" NRTI" is an abbreviation for nucleoside reverse transcriptase inhibitor. 

" NNRTI" is an abbreviation for non nucleoside reverse transcriptase inhibitor. 

" RT" is an abbreviation for reverse transcriptase. 

1 5 " PCR" is an abbreviation for "polymerase chain reaction." 

The amino acid notations used herein for the twenty genetically encoded L-amino 
acids are conventional and are as follows: 



Amino Acid 


One-Letter 


Three Letter 




Abbreviation 


Abbreviation 


Alanine 


A 


Ala 


Arginine 


R 


Arg 


Asparagine 


N 


Asn 


Aspartic acid 


D 


Asp 


Cysteine 


C 


Cys 


Glutamine 


Q 


Gin 


Glutamic acid 


E 


Glu 


Glycine 


G 


Gly 


Histidine 


H 


His 


Isoleucine 


I 


He 


Leucine 


L 


Leu 


Lysine 


K 


Lys 


Methionine 


M 


Met 


Phenylalanine 


F 


Phe 


Proline 


P 


Pro 


Serine 


S 


Ser 


Threonine 


T 


Thr 


Tryptophan 


W 


Trp 



-7- 

CAl: 348070.1 



Amino Acid 



One-Letter 
Abbreviation 

Y 
V 



Three Letter 
Abbreviation 

Tyr 
Val 



Tyrosine 
Valine 



Unless noted otherwise, when polypeptide sequences are presented as a series of one- 
letter and/or three-letter abbreviations, the sequences are presented in the N -> C direction, in 
accordance with common practice. 



- standard-one letter symbol for the amino acid in the sequence, and N is the position in the 
sequence. Mutations are represented herein as AiNA 2 , wherein Ai is the standard one letter 
symbol for the amino acid in the reference protein sequence, A 2 is the standard one letter 
symbol for the amino acid in the mutated protein sequence, and N is the position in the amino 

10 acid sequence. For example, a G25M mutation represents a change from glycine to 

methionine at amino acid position 25. Mutations may also be represented herein as NA 2 , 
wherein N is the position in the amino acid sequence and A 2 is the standard one letter symbol 
for the amino acid in the mutated protein sequence (e.g., 25M, for a change from the wild- 
type amino acid to methionine at amino acid position 25). Additionally, mutations may also 

15 be represented herein as AiN, wherein Ai is the standard one letter symbol for the amino acid 
in the reference protein sequence and N is the position in the amino acid sequence (e.g., G25 
represents a change from glycine to any amino acid at amino acid position 25). This notation 
is typically used when the amino acid in the mutated protein sequence is either not known or, 
if the amino acid in the mutated protein sequence could be any amino acid, except that found 

20 in the reference protein sequence. The amino acid positions are numbered based on the 
full-length sequence of the protein from which the region encompassing the mutation is 
derived. Representations of nucleotides and point mutations in DNA sequences are 
analogous. 



25 comprising specific nucleobase sequences are the conventional one-letter abbreviations. 
Thus, when included in a nucleic acid, the naturally occurring encoding nucleobases are 
abbreviated as follows: adenine (A), guanine (G), cytosine (C), thymine (T) and uracil (U). 
Unless specified otherwise, single-stranded nucleic acid sequences that are represented as a 
series of one-letter abbreviations, and the top strand of double-stranded sequences, are 

30 presented in the 5' -> 3' direction. 



5 



Individual amino acids in a sequence are represented herein as AN, wherein A is the 



The abbreviations used throughout the specification to refer to nucleic acids 
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5.2 Definitions 

As used herein, the following terms shall have the following meanings: 

Unless otherwise specified, " primary mutation" refers to a mutation that affects the 
enzyme active site, i.e. at those amino acid positions that are involved in the enzyme- 
5 substrate complex, or that reproducibly appears in an early round of replication when a virus 
is subject to the selective pressure of an anti-viral agent, or, that has a large effect on 
phenotypic susceptibility to an anti-viral agent. 

" Secondary Mutation" refers to a mutation that is not a primary mutation and that 
contributes to reduced susceptibility or compensates for gross defects imposed by a primary 
10 mutation. 

A " phenotypic assay" is a test that measures the sensitivity of a virus (such as HIV) to 
a specific anti-viral agent. 

A " genotypic assay" is a test that determines a genetic sequence of an organism, a part 
of an organism, a gene or a part of a gene. Such assays are frequently performed in HIV to 
15 establish whether certain mutations are associated with drug resistance are present. 

As used herein, " genotypic data" are data about the genotype of, for example, a virus. 
Examples of genotypic data include, but are not limited to, the nucleotide or amino acid 
sequence of a virus, a part of a virus, a viral gene, a part of a viral gene, or the identity of one 
or more nucleotides or amino acid residues in a viral nucleic acid or protein. 

20 " Susceptibility" refers to a virus' response to a particular drug. A virus that has 

decreased or reduced susceptibility to a drug has an increased resistance or decreased 
sensitivity to the drug. A virus that has increased or enhanced or greater susceptibility to a 
drug has an increased sensitivity or decreased resistance to the drug. 

Phenotypic susceptibility of a virus to a given drug is a continuum. Nonetheless, it is 
25 practically useful to define a threshold or thresholds to simplify interpretation of a particular 
fold-change result. For drugs where sufficient clinical outcome data have been gathered, it is 
possible to define a "clinical cutoff value," as below. 

" Clinical Cutoff Value" refers to a specific point at which resistance begins and 
sensitivity ends. It is defined by the drug susceptibility level at which a patient's probability 
30 of treatment failure with a particular drug significantly increases. The cutoff value is 

different for different anti-viral agents, as determined in clinical studies. Clinical cutoff 
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values are determined in clinical trials by evaluating resistance and outcomes data. Drug 
susceptibility (phenotypic) is measured at treatment initiation. Treatment response, such as 
change in viral load, is monitored at predetermined time points through the course of the 
treatment. The drug susceptibility is correlated with treatment response and the clinical 
5 cutoff value is determined by resistance levels associated with treatment failure (statistical 
analysis of overall trial results). 

"ICn» refers to Inhibitory Concentration. It is the concentration of drug in the patient's 
blood or in vitro needed to suppress the reproduction of a disease-causing microorganism 
(such as HIV) by n %. Thus, " ICsrr refers to the concentration of an anti-viral agent at which 
10 virus replication is inhibited by 50% of the level observed in the absence of the drug. 

"Patient IC50" refers to the drug concentration required to inhibit replication of the virus from 
a patient by 50% and "reference IC50" refers to the drug concentration required to inhibit 
replication of a reference or wild-type virus by 50%. Similarly, " IC90" refers to the 
concentration of an anti-viral agent at which 90% of virus replication is inhibited. 

15 A " fold change" is a numeric comparison of the drug susceptibility of a patient virus 

and a drug-sensitive reference virus. It is the ratio of the Patient IC50 to the drug-sensitive 
reference IC50, i.e., Patient ICso/Reference IC50 = Fold Change ("FC"). A fold change of 1.0 
indicates that the patient virus exhibits the same degree of drug susceptibility as the 
drug-sensitive reference virus. A fold change less than 1 indicates the patient virus is more 

20 sensitive than the drug-sensitive reference virus. A fold change greater than 1 indicates the 
patient virus is less susceptible than the drug-sensitive reference virus. A fold change equal 
to or greater than the clinical cutoff value means the patient virus has a lower probability of 
response to that drug. A fold change less than the clinical cutoff value means the patient 
virus is sensitive to that drug. 

25 A virus has an "increased likelihood of having impaired replication capacity" if the 

virus has a property, for example, a mutation, that is correlated with an impaired replication 
capacity. A property of a virus is correlated with an impaired replication capacity if a 
population of viruses having the property has, on average, an impaired replication capacity 
relative to that of an otherwise similar population of viruses lacking the property. Thus, the 

30 correlation between the presence of the property and impaired replication capacity need not 
be absolute, nor is there a requirement that the property is necessary (i.e., that the property 
plays a causal role in impairing replication capacity) or sufficient (i.e., that the presence of 
the property alone is sufficient) for impairing replication capacity. 
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The term " % sequence homology" is used interchangeably herein with the terms 
" % homology ," " % sequence identity" and " % identity" and refers to the level of amino acid 
sequence identity between two or more peptide sequences, when aligned using a sequence 
alignment program. For example, as used herein, 80% homology means the same thing as 
5 80% sequence identity determined by a defined algorithm, and accordingly a homologue of a 
given sequence has greater than 80% sequence identity over a length of the given sequence. 
Exemplary levels of sequence identity include, but are not limited to, 60, 70, 80, 85, 90, 95, 
98% or more sequence identity to a given sequence. 

Exemplary computer programs which can be used to determine identity between two 
10 sequences include, but are not limited to, the suite of BLAST programs, e.g., BLASTN, 
BLASTX, and TBLASTX, BLASTP and TBLASTN, publicly available on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. See also Altschul et aL, 1990, J. Mol BioL 
215:403-10 (with special reference to the published default setting, i.e., parameters w=4, 
t=17) and Altschul et aL, 1997, Nucleic Acids Res., 25:3389-3402. Sequence searches are 
15 typically carried out using the BLASTP program when evaluating a given amino acid 
sequence relative to amino acid sequences in the GenBank Protein Sequences and other 
public databases. The BLASTX program is preferred for searching nucleic acid sequences 
that have been translated in all reading frames against amino acid sequences in the GenBank 
Protein Sequences and other public databases. Both BLASTP and BLASTX are run using 
20 default parameters of an open gap penalty of 1 1 .0, and an extended gap penalty of 1 .0, and 
utilize the BLOSUM-62 matrix. See Altschul, et al., 1997. 

A preferred alignment of selected sequences in order to determine "% identity" 
between two or more sequences, is performed using for example, the CLUSTAL-W program \ 
in MacVector version 6.5, operated with default parameters, including an open gap penalty of 
25 10.0, an extended gap penalty of 0.1 , and a BLOSUM 30 similarity matrix. 

" Polar Amino Acid" refers to a hydrophilic amino acid having a side chain that is 
uncharged at physiological pH, but which has at least one bond in which the pair of electrons 
shared in common by two atoms is held more closely by one of the atoms. Genetically 
encoded polar amino acids include Asn (N), Gin (Q) Ser (S) and Thr (T). 

30 " Nonpolar Amino Acid" refers to a hydrophobic amino acid having a side chain that 

is uncharged at physiological pH and which has bonds in which the pair of electrons shared in 
common by two atoms is generally held equally by each of the two atoms (i.e., the side chain 
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is not polar). Genetically encoded apolar amino acids include Ala (A), Gly (G), He (I), Leu 
(L), Met (M) and Val (V) . 

" Hydrophilic Amino Acid" refers to an amino acid exhibiting a hydrophobicity of less 
than zero according to the normalized consensus hydrophobicity scale of Eisenberg et al , 
5 1984, J. Mol. Biol. 1 79: 125-142. Genetically encoded hydrophilic amino acids include Arg 
(R), Asn (N), Asp (D), Glu (E), Gin (Q), His (H), Lys (K), Ser (S) and Thr (T). 

" Hydrophobic Amino Acid" refers to an amino acid exhibiting a hydrophobicity of 
greater than zero according to the normalized consensus hydrophobicity scale of Eisenberg et 
al, 1984, J. Mol. Biol. 179:125-142. Genetically encoded hydrophobic amino acids include 
10 Ala (A), Gly (G), He (I), Leu (L), Met (M), Phe (F), Pro (P), Tip (W), Tyr (Y) and Val (V). 

" Acidic Amino Acid" refers to a hydrophilic amino acid having a side chain pK value 
of less than7. Acidic amino acids typically have negatively charged side chains at 
physiological pH due to loss of a hydrogen ion. Genetically encoded acidic amino acids 
include Asp (D) and Glu (E). 

15 " Basic Amino Acid" refers to a hydrophilic amino acid haying a side chain pK value 

of greater than7. Basic amino acids typically have positively charged side chains at 
physiological pH due to association with hydronium ion. Genetically encoded basic amino 
acids include Arg (R), His (H) and Lys (K). 

A " mutation" is a change in an amino acid sequence or in a corresponding nucleic 
20 acid sequence relative to a reference nucleic acid or polypeptide. For embodiments of the 
invention comprising HIV protease or reverse transcriptase, the reference nucleic acid 
encoding protease or reverse transcriptase is the protease or reverse transcriptase coding 
sequence, respectively, present in NL4-3 HIV (GenBank Accession No. AF324493). 
Likewise, the reference protease or reverse transcriptase polypeptide is that encoded by the 
25 NL4-3 HIV sequence. Although the amino acid sequence of a peptide can be determined 

directly by, for example, Edman degradation or mass spectroscopy, more typically, the amino 
sequence of a peptide is inferred from the nucleotide sequence of a nucleic acid that encodes 
the peptide. Any method for determining the sequence of a nucleic acid known in the art can 
be used, for example, Maxam-Gilbert sequencing (Maxam et al, 1980, Methods in 
30 Enzymology 65:499), dideoxy sequencing (Sanger et al, 1977, Proc. Natl Acad. Sci. USA 
74:5463) or hybridization-based approaches {see e.g., Sambrook et al, 2001, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 3 rd ed., NY; and Ausubel et 
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al., 1989, Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 
Interscience, NY). 

A " resistance-associated mutation" ("RAM") in a virus is a mutation correlated with 
reduced susceptibility of the virus to anti-viral agents. A RAM can be found in several 
5 viruses, including, but not limited to a human immunodeficiency virus ("HIV"). Such 
mutations can be found in one or more of the viral proteins, for example, in the protease, 
integrase, envelope or reverse transcriptase of HIV. A RAM is defined relative to a reference 
strain. For embodiments of the invention comprising HIV protease, the reference protease is 
the protease present in NL4-3 HIV (GenBank Accession No. AF324493). 

10 A " mutant" is a virus, gene or protein having a sequence that has one or more changes 

relative to a reference virus, gene or protein. 

The terms " peptide ," " polypeptide" and " protein" are used interchangeably 
throughout. 

The terms " reference" and " wild-type" are used interchangeably throughout. 

1 5 The terms " polynucleotide ," " oligonucleotide" and " nucleic acid" are used 

interchangeably throughout. 

5.3 Mutations Associated with Impaired Replication Capacity 

The present invention provides nucleic acids and polypeptides comprising a mutation 
in the reverse transcriptase of HIV associated with resistance to a NNRTI and with impaired 
20 replication capacity. Preferably, the HIV is human immunodeficiency virus type 1 

("HIV-1"). Examples of NNRTI include, but are not limited to, nevirapine ("NVP"), 
delavirdine ("DLV") and efavirenz ("EFV"). 

In one aspect, the present invention provides peptides, polypeptides or proteins 
comprising a mutation in the reverse transcriptase of HIV associated with resistance to a 
25 NNRTI and with impaired replication capacity. Polypeptides of the invention include 

peptides, polypeptides and proteins that are modified or derived from these polypeptides. In 
one embodiment, the polypeptide comprises post-translational .modifications. 

In one embodiment, the invention provides a polypeptide derived from the HIV 
reverse transcriptase and comprising a mutation at an amino acid position selected from the 
30 group consisting of 98, 100, 101, 103, 106, 108, 179, 181, 188, 190, 225 and 236. In a more 
particularly defined embodiment, the mutation is selected from the group of mutations 
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consisting of A98G, L100I, K101E, K103N, V106A, V106I, V106M, Y181C, Y188A, 
Y188C, Y188H, Y188L, G190A, G190C, G190E, G190T, G190V, G190Q, G190S and 
G190V. 

In another embodiment, the invention provides a polypeptide derived from the HIV 
5 reverse transcriptase and comprising a combination of mutations at two or more amino acid 
positions. Examples of such combinations include, but are not limited to, P236L, K103N and 
Y181C; V106A and Y181C; K103N and G190S; P236L and K103N; L100I, K103N and 
G190S; P236L and Y181C; K103N and G190A; K103N and Y188L; K103N and Y181C; 
L100I and K103N; K103N, Y181C and 225H; Y181C and A98G. 

10 In another aspect, the present invention provides polynucleotides, oligonucleotides or 

nucleic acids encoding or relating to a polypeptide of the invention or a biologically active 
portion thereof, including, for example, nucleic acid molecules sufficient for use as 
hybridization probes, PCR primers or sequencing primers for identifying, analyzing, mutating 
or amplifying the nucleic acids of the invention. 

15 The nucleic acid can be any length. The nucleic acid can be, for example, at least 6, 

7,8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23,24,25,26,27,28,29,30,31,32, 
33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 100, 1 10, 120, 125, 150, 
175, 200, 250, 300, 350, 375, 400, 425, 450, 475 or 500 nucleotides in length. The nucleic 
acid can be, for example, less than 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 

20 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 
70, 75, 80, 85, 90, 100, 110, 120, 125, 150, 175, 200, 250, 300, 350, 375, 400, 425, 450, 475, 
500, 525, 550, 575, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 
1500, 1600, 1700, 1800, 1900, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 
7000, 7500, 8000, 8500, 9000, 9500 or 10000 nucleotides in length. In a preferred 

25 embodiment, the nucleic acid has a length and a sequence suitable for detecting a mutation 
described herein, for example, as a probe or a primer. 

In another embodiment, the present invention provides nucleic acid molecules that are 
suitable for use as primers or hybridization probes for the detection of nucleic acid sequences 
of the invention. A nucleic acid molecule of the invention can comprise only a portion of a 
30 nucleic acid sequence encoding a full length polypeptide of the invention for example, a 

fragment that can be used as a probe or primer or a fragment encoding a biologically active 
portion of a polypeptide of the invention. The probe can comprise a labeled group attached 
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thereto, e.g., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. In 
various embodiments, the nucleic acid molecules of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone. 

5.4 Finding Mutations Associated with Impaired Replication Capacity 

In another aspect, the present invention provides methods for finding mutations 
associated with impaired replication capacity in a virus or a derivative of the virus. 

5.4.1 The Virus and Viral Samples 

An impaired replication capacity-associated mutation according to the present 
invention can be present in any type of virus, for example, any virus found in animals. In one 
embodiment of the invention, the virus includes viruses known to infect mammals, including 
dogs, cats, horses, sheep, cows etc. In a preferred embodiment, the virus is known to infect 
primates. In an even more preferred embodiment the virus is known to infect humans. 
Examples of human viruses include, but are not limited to, human immunodeficiency virus 
("HIV"), herpes simplex virus, cytomegalovirus virus, varicella zoster virus, other human 
herpes viruses, influenza A virus, respiratory syncytial virus, hepatitis A, B arid C viruses, 
rhinovirus, and human papilloma virus. In a preferred embodiment of the invention, the virus 
is HIV. Preferably, the virus is human immunodeficiency virus type 1 ("HIV-l"). The 
foregoing are representative of certain viruses for which there is presently available anti-viral 
chemotherapy and represent the viral families retroviridae, herpesviridae, orthomyxoviridae, 
paramxyxovirus, picornavirus, flavivirus, pneumovirus and hepadnaviridae. This invention 
can be used with other viral infections due to other viruses within these families as well as 
viral infections arising from viruses in other viral families for which there is or there is not a 
currently available therapy. 

An impaired replication capacity-associated mutation according to the present 
invention can be found in a viral sample obtained by any means known in the art for 
obtaining viral samples. Such methods include, but are not limited to, obtaining a viral 
sample from a human or an animal infected with the virus or obtaining a viral sample from a 
viral culture. In one embodiment, the viral sample is obtained from a human individual 
infected with the virus. The viral sample could be obtained from any part of the infected 
individual's body or any secretion expected to contain the virus. Examples of such parts 
include, but are not limited to blood, serum, plasma, sputum, lymphatic fluid, semen, vaginal 
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mucus and samples of other bodily fluids. In a preferred embodiment, the sample is a blood, 
serum or plasma sample. 

In another embodiment, an impaired replication capacity-associated mutation 
according to the present invention is present in a virus that can be obtained from a culture. In 
5 some embodiments, the culture can be obtained from a laboratory. In other embodiments, the 
culture can be obtained from a collection, for example, the American Type Culture 
Collection. 

In certain embodiments, an impaired replication capacity-associated mutation 
according to the present invention is present in a derivative of a virus. In one embodiment, 

10 the derivative of the virus is not itself pathogenic. In another embodiment, the derivative of 
the virus is a plasmid-based system, wherein replication of the plasmid or of a cell transfected 
with the plasmid is affected by the presence or absence of the selective pressure, such that 
mutations are selected that increase resistance to the selective pressure. In some 
embodiments, the derivative of the virus comprises the nucleic acids or proteins of interest, 

15 for example, those nucleic acids or proteins to be targeted by an anti-viral treatment. In one 
embodiment, the genes of interest can be incorporated into a vector. See, e.g., U.S. Patent 
Numbers 5,837,464 and 6,242,1 87 and PCT publication, WO 99/67 '427, each of which is 
incorporated herein by reference. In a preferred embodiment, the genes can be those that 
encode for a protease or reverse transcriptase. 

20 In another embodiment, the intact virus need not be used. Instead, a part of the virus 

incorporated into a vector can be used. Preferably that part of the virus is used that is 
targeted by an anti-viral drug. 

In another embodiment, an impaired replication capacity-associated mutation 
according to the present invention is present in a genetically modified virus. The virus can be 

25 genetically modified using any method known in the art for genetically modifying a virus. 
For example, the virus can be grown for a desired number of generations in a laboratory 
culture. In one embodiment, no selective pressure is applied {i.e., the virus is not subjected to 
a treatment that favors the replication of viruses with certain characteristics), and new 
mutations accumulate through random genetic drift. In another embodiment, a selective 

30 pressure is applied to the virus as it is grown in culture {i.e., the virus is grown under 

conditions that favor the replication of viruses having one or more characteristics). In one 
embodiment, the selective pressure is an anti-viral treatment. Any known anti-viral treatment 
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can be used as the selective pressure. In one embodiment, the virus is HIV and the selective 
pressure is a NNRTL In another embodiment, the virus is HIV-l and the selective pressure is 
a NNRTL Any NNRTL can be used to apply the selective pressure. Examples of NNRTIs 
include, but are not limited to, NVP, DLV and EFV. By treating HIV cultured in vitro with a 
5 NNRTI, one can select for mutant strains of HIV that have an increased resistance to the 

NNRTL The stringency of the selective pressure can be manipulated to increase or decrease 
the survival of viruses not having the selected-for characteristic. 

In another aspect, an impaired replication capacity-associated mutation according to 
the present invention is made by mutagenizing a virus, a viral genome, or a part of a viral 

10 genome. Any method of mutagenesis known in the art can be used for this purpose. In one 
embodiment, the mutagenesis is essentially random. In another embodiment, the essentially 
random mutagenesis is performed by exposing the virus, viral genome or part of the viral 
genome to a mutagenic treatment. In another embodiment, a gene that encodes a viral protein 
that is the target of an anti-viral therapy is mutagenized. Examples of essentially random 

15 mutagenic treatments include, for example, exposure to mutagenic substances {e.g., ethidium 
bromide, ethylmethanesulphonate, ethyl nitroso urea (ENU) etc.) radiation {e.g., ultraviolet 
light), the insertion and/or removal of transposable elements {e.g., Tn5, TnlO), or replication 
in a cell, cell extract, or in vitro replication system that has an increased rate of mutagenesis. 
See, e.g., Russell et al, 1979, Proc. Nat. Acad. Sci. USA 76:5918-5922; Russell, W., 1982, 

20 Environmental Mutagens and Carcinogens: Proceedings of the Third International 

Conference on Environmental Mutagens. One of skill in the art will appreciate that while 
each of these methods of mutagenesis is essentially random, at a molecular level, each has its 
own preferred targets. 

In another aspect, an impaired replication capacity-associated mutation is made using 
25 site-directed mutagenesis. Any method of site-directed mutagenesis known in the art can be 
used {see e.g., Sambrook et al., 2001, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, 3 rd ed., NY; and Ausubel et al, 1989, Current Protocols in 
Molecular Biology, Greene Publishing Associates and Wiley Interscience, NY). The site 
directed mutagenesis can be directed to, e.g., a particular gene or genomic region, a particular 
30 part of a gene or genomic region, or one or a few particular nucleotides within a gene or 
genomic region. In one embodiment, the site directed mutagenesis is directed to a viral 
genomic region, gene, gene fragment, or nucleotide based on one or more criteria. In one 
embodiment, a gene or a portion of a gene is subjected to site-directed mutagenesis because it 
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encodes a protein that is known or suspected to be a target of an anti-viral therapy, e.g. , the 
gene encoding the HIV reverse transcriptase. In another embodiment, a portion of a gene, or 
one or a few nucleotides within a gene, are selected for site-directed mutagenesis. In one 
embodiment, the nucleotides to be mutagenized encode amino acid residues that are known 
5 or suspected to interact with an anti-viral compound. In another embodiment, the nucleotides 
to be mutagenized encode amino acid residues that are known or suspected to be mutated in 
viral strains having an impaired replication capacity. In another embodiment, the 
mutagenized nucleotides encode amino acid residues that are adjacent to or near in the 
primary sequence of the protein residues known or suspected to interact with an anti-viral 

10 compound or known or suspected to be mutated in viral strains having an impaired 

replication capacity. In another embodiment, the mutagenized nucleotides encode amino acid 
residues that are adjacent to or near to in the secondary, tertiary or quaternary structure of the 
protein residues known or suspected to interact with an anti-viral compound or known or 
suspected to be mutated in viral strains having an impaired replication capacity. In another 

15 embodiment, the mutagenized nucleotides encode amino acid residues in or near the active 
site of a protein that is known or suspected to bind to an anti-viral compound. See, e.g., 
Sarkar and Sommer, 1990, Biotechniques, 8:404-407. 

5.4.2 Detecting the Presence or Absence of Mutations in a Virus 

The presence or absence of an impaired replication capacity-associated mutation 
20 according to the present invention in a virus can be detected by any means known in the art 
for detecting a mutation. The mutation can be detected in the viral gene that encodes a 
particular protein, or in the protein itself, i.e., in the amino acid sequence of the protein. 

In one embodiment, the mutation is in the viral genome. Such a mutation can be in, 
for example, a gene encoding a viral protein, in a cis or trans acting regulatory sequence of a 
25 gene encoding a viral protein, an intergenic sequence, or an intron sequence. The mutation 
can affect any aspect of the structure, function, replication or environment of the virus that 
changes its susceptibility to an anti-viral treatment. In one embodiment, the mutation is in a 
gene encoding a viral protein that is the target of an anti-viral treatment. 

A mutation within a viral gene can be detected by utilizing a number of techniques. 
30 Viral DNA or RNA can be used as the starting point for such assay techniques, and may be 
isolated according to standard procedures which are well known to those of skill in the art. 
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The detection of a mutation in specific nucleic acid sequences, such as in a particular 
region of a viral gene, can be accomplished by a variety of methods including, but not limited 
to, restriction-fragment-length-polymorphism detection based on allele-specific restriction- 
endonuclease cleavage (Kan and Dozy, 1978, Lancet ii:910-912), mismatch-repair detection 
5 (Faham and Cox, 1995, Genome Res 5:474-482), binding of MutS protein (Wagner et aL, 
1995, Nucl Acids Res 23:3944-3948), denaturing-gradient gel electrophoresis (Fisher et aL, 
1983, Proc. Natl Acad. Sci. U.S.A. 80:1579-83), single-strand-conformation-polymorphism 
detection (Orita et aL, 1983, Genomics 5:874-879), RNAase cleavage at mismatched base- 
pairs (Myers et aL, 1985, Science 230:1242), chemical (Cotton et aL, 1988, Proc. Natl. Acad. 

10 Sci. U.S.A. 85:4397-4401) or enzymatic (Youil et aL, 1995, Proc. Natl. Acad. Sci. U.S.A. 

92:87-91) cleavage of heteroduplex DNA, methods based on oligonucleotide-specific primer 
extension (Syvanen et aL, 1990, Genomics 8:684-692), genetic bit analysis (Nikiforov et aL, 
1994, Nucl Acids Res 22:4167-4175), oligonucleotide-ligation assay (Landegren et aL, 1988, 
Science 241:1077), oligonucleotide-specific ligation chain reaction ("LCR") (Barrany, 1991, 

15 Proc. Natl. Acad. Sci. U.S.A. 88:189-193), gap-LCR (Abravaya et aL, 1995, Nucl Acids Res 
23:675-682), radioactive or fluorescent DNA sequencing using standard procedures well 
known in the art, and peptide nucleic acid (PNA) assays (Orum et aL, 1993, Nucl. Acids Res. 
21:5332-5356; Thiede et aL, 1996, Nucl. Acids Res. 24:983-984). 

In addition, viral DNA or RNA may be used in hybridization or amplification assays 
20 to detect abnormalities involving gene structure, including point mutations, insertions, 
deletions and genomic rearrangements. Such assays may include, but are not limited to, 
Southern analyses (Southern, 1975, J. Mol. BioL 98:503-517), single stranded conformational 
polymorphism analyses (SSCP) (Orita et aL, 1989, Proc. Natl. Acad. Sci. USA 
86:2766-2770), and PCR analyses (U.S. Patent Nos. 4,683,202; 4,683,195; 4,800,159; and 
25 4,965,188; PCR Strategies, 1995 Innis et al. (eds.), Academic Press, Inc.). 

Such diagnostic methods for the detection of a gene-specific mutation can involve for 
example, contacting and incubating the viral nucleic acids with one or more labeled nucleic 
acid reagents including recombinant DNA molecules, cloned genes or degenerate variants 
thereof, under conditions favorable for the specific annealing of these reagents to their 
30 complementary sequences. Preferably, the lengths of these nucleic acid reagents are at least* 
15 to 30 nucleotides. After incubation, all non-annealed nucleic acids are removed from the 
nucleic acid molecule hybrid. The presence of nucleic acids which have hybridized, if any 
such molecules exist, is then detected. Using such a detection scheme, the nucleic acid from 

-19- 

CAl: 348070.1 



the virus can be immobilized, for example, to a solid support such as a membrane, or a plastic 
surface such as that on a microtiter plate or polystyrene beads. In this case, after incubation, 
non-annealed, labeled nucleic acid reagents of the type described above are easily removed. 
Detection of the remaining, annealed, labeled nucleic acid reagents is accomplished using 
5 standard techniques well-known to those in the art. The gene sequences to which the nucleic 
acid reagents have annealed can be compared to the annealing pattern expected from a 
normal gene sequence in order to determine whether a gene mutation is present. 

Alternative diagnostic methods for the detection of gene specific nucleic acid 
molecules may involve their amplification, e.g., by PCR (U.S. Patent Nos. 4,683,202; 
10 4,683,195; 4,800,159; and 4,965,188; PCR Strategies, 1995 Innis et al. (eds.), Academic 
Press, Inc.), followed by the detection of the amplified molecules using techniques well 
known to those of skill in the art. The resulting amplified sequences can be compared to those 
which would be expected if the nucleic acid being amplified contained only normal copies of 
the respective gene in order to determine whether a gene mutation exists. 

15 Additionally, the nucleic acid can be sequenced by any sequencing method known in 

the art. For example, the viral DNA can be sequenced by the dideoxy method of Sanger et 
al, 1977, Proc. Natl. Acad. Sci. USA 74:5463, as further described by Messing et al, 1981, 
Nuc. Acids Res. 9:309, or by the method of Maxam et al, 1980, Methods in Enzymology 
65:499. See also the techniques described in Sambrook et al, 2001, Molecular Cloning: A 

20 Laboratory Manual, Cold Spring Harbor Laboratory, 3 rd ed., NY; and Ausubel et al, 1989, 
Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 
Interscience, NY. 

Antibodies directed against the viral gene products, i.e., viral proteins or viral peptide 
fragments can also be used to detect mutations in the viral proteins. Alternatively, the viral 

25 protein or peptide fragments of interest can be sequenced by any sequencing method known 
in the art in order to yield the amino acid sequence of the protein of interest. An example of 
such a method is the Edman degradation method which can be used to sequence small 
proteins or polypeptides. Larger proteins can be initially cleaved by chemical or enzymatic 
reagents known in the art, for example, cyanogen bromide, hydroxylamine, trypsin or 

30 chymo trypsin, and then sequenced by the Edman degradation method. 
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5.5 Measuring Phenotypic Susceptibility of a Mutant Virus 

Any method known in the art can be used to determine the phenotypic susceptibility 
of a mutant virus or population of viruses to an anti-viral therapy. See e.g., U.S. Patent Nos. 
5,837,464 and 6,242,187, incorporated herein by reference in their entirities. In some 
5 embodiments a phenotypic analysis is performed, i.e., the susceptibility of the virus to a given 
anti-viral agent is assayed with respect to the susceptibility of a reference virus without the 
mutations. This is a direct, quantitative measure of drug susceptibility and can be performed 
by any method known in the art to determine the susceptibility of a virus to an anti-viral 
agent. An example of such methods includes, but is not limited to, determining the fold 
10 change in IC50 values with respect to a reference virus. Phenotypic testing measures the 

ability of a specific viral strain to grow in vitro in the presence of a drug inhibitor. A virus is 
less susceptible to a particular drug when more of the drug is required to inhibit viral activity, 
versus the amount of drug required to inhibit the reference virus. 

In one embodiment, a phenotypic analysis is performed and used to calculate the IC50 
15 or IC90 of a drug for a viral strain. The results of the analysis can also be presented as fold- 
change in IC50 or IC90 for each viral strain as compared with a drug-susceptible control strain 
or a prior viral strain from the same patient. Because the virus is directly exposed to each of 
the available anti -viral medications, results can be directly linked to treatment response. For 
example, if the patient virus shows resistance to a particular drug, that drug is avoided or 
20 omitted from the patient's treatment regimen, allowing the physician to design a treatment 
plan that is more likely to be effective for a longer period of time. 

In another embodiment, the phenotypic analysis is performed using recombinant virus 
assays ("RVAs"). RVAs use virus stocks generated by homologous recombination between 
viral vectors and viral gene sequences, amplified from the patient virus. In some 
25 embodiments, the viral vector is a HIV vector and the viral gene sequences are protease 
and/or reverse transcriptase sequences. 

In a preferred embodiment, the phenotypic analysis is performed using 
PHENOSENSE™ (ViroLogic Inc., South San Francisco, CA). See Petropoulos et al 9 2000, 
Antimicrob. Agents Chemother. 44:920-928; U.S. Patent Nos. 5,837,464 and 6,242,187. 
30 PHENOSENSE™ is a phenotypic assay that achieves the benefits of phenotypic testing and 
overcomes the drawbacks of previous assays. Because the assay has been automated, 
PHENOSENSE™ offers higher throughput under controlled conditions. The result is an 
assay that accurately defines the susceptibility profile of a patient's HIV isolates to all 
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currently available antiretroviral drugs, and delivers results directly to the physician within 
about 10 to about 15 days of sample receipt. PHENOSENSE™ is accurate and can obtain 
results with only one round of viral replication, thereby avoiding selection of subpopulations 
of virus. The results are quantitative, measuring varying degrees of drug susceptibility, and 
5 sensitive - the test can be performed on blood specimens with a viral load of about 500 

copies/mL and can detect minority populations of some drug-resistant virus at concentrations 
of 10% or less of total viral population. Furthermore, the results are reproducible and can 
vary by less than about 1.4-2.5 fold, depending on the drug, in about 95% of the assays 
performed. 

10 PHENOSENSE™ can be used with nucleic acids from amplified viral gene 

sequences. As discussed in Section 5.4.1, the sample containing the virus may be a sample 
from a human or an animal infected with the virus or a sample from a culture of viral cells. 
In one embodiment, the viral sample comprises a genetically modified laboratory strain. 

A resistance test vector ("RTV") can then be constructed by incorporating the 
15 amplified viral gene sequences into a replication defective viral vector by using any method 
known in the art of incorporating gene sequences into a vector. In one embodiment, 
restrictions enzymes and conventional cloning methods are used. See Sambrook et ah , 2001, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 3 rd ed., NY; and 
Ausubel et al., 1989, Current Protocols in Molecular Biology, Greene Publishing Associates 
20 and Wiley Interscience, NY. In a preferred embodiment, Apal and Pin AI restriction enzymes 
are used. Preferably, the replication defective viral vector is the indicator gene viral vector 
("IGVV"). In a preferred embodiment, the viral vector contains a means for detecting 
replication of the RTV. Preferably, the viral vector contains a luciferase expression cassette. 

The assay can be performed by first co-transfecting host cells with RTV DNA and a 
25 plasmid that expresses the envelope proteins of another retrovirus, for example, amphotropic 
murine leukemia virus (MLV). Following transfection, virus particles can be harvested and 
used to infect fresh target cells. The completion of a single round of viral replication can be 
detected by the means for detecting replication contained in the vector. In a preferred 
embodiment, the completion of a single round of viral replication results in the production of 
30 luciferase. Serial concentrations of anti-viral agents can be added at either the transfection 
step or the infection step. 
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Susceptibility to the anti-viral agent can be measured by comparing the replication of 
the vector in the presence and absence of the anti-viral agent. For example, susceptibility to 
the anti-viral agent can be measured by comparing the luciferase activity in the presence and 
absence of the anti-viral agent. Susceptible viruses would produce low levels of luciferase 
5 activity in the presence of antiviral agents, whereas viruses with reduced susceptibility would 
produce higher levels of luciferase activity. 

In preferred embodiments, PHENOSENSE™ is used in evaluating the phenotypic 
susceptibility of HIV- 1 to anti -viral drugs. Preferably, the anti -viral drug is aNNRTI. In 
preferred embodiments, the reference viral strain is HIV strain NL4-3 or HXB-2. 

10 In one embodiment, viral nucleic acid, for example, HIV-1 RNA is extracted from 

plasma samples, and a fragment of, or entire viral genes could be amplified by methods such 
as, but not limited to PCR. See, e.g., Hertogs et al. 9 1998, Antimicrob Agents Chemother 
42(2):269-76. In one example, a 2.2-kb fragment containing the entire HIV-1 PR- and 
RT-coding sequence is amplified by nested reverse transcription-PCR. The pool of amplified 

1 5 nucleic acid, for example, the PR-RT-coding sequences, is then cotransfected into a host cell 
such as CD4+ T lymphocytes (MT4) with the pGEMT3 deltaPRT plasmid from which most 
of the PR (codons 10 to 99) and RT (codons 1 to 482) sequences are deleted. Homologous 
recombination leads to the generation of chimeric viruses containing viral coding sequences, 
such as the PR- and RT-coding sequences derived from HIV-1 RNA in plasma. The 

20 susceptibilities of the chimeric viruses to all currently available anti-viral agents targeting the 
products of the transfected genes (proRT and/or PR inhibitors, for example), can be 
determined by any cell viability assay known in the art. For example, an MT4 
cell-3-(4,5-dimethylthiazol-2-yl) -2,5-diphenyltetrazolium bromide-based cell viability assay 
can be used in an automated system that allows high sample throughput. The profile of 

25 resistance to all the anti- viral agents, such as the RT and PR inhibitors can be displayed 
graphically in a single PR-RT- Anti viro gram. 

Other assays for evaluating the phenotypic susceptibility of a virus to anti-viral drugs 
known to one of skill in the art can be used. See, e.g., Shi and Mellors, 1997, Antimicrob 
Agents Chemother. 41(12):2781-85; Gervaix erf al. 9 1997, Proc Natl Acad Sci U. S. A. 
30 94(9):4653-8; Race et al. y 1999, AIDS 13:2061-2068, incorporated herein by reference in 
their entireties. 
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In another embodiment, the susceptibility of a virus to treatment with an anti-viral 
treatment is determined by assaying the activity of the target of the anti-viral treatment in the 
presence of the anti-viral treatment. In one embodiment, the virus is HIV, the anti-viral 
treatment is a protease inhibitor, and the target of the anti -viral treatment is the HIV protease. 
5 See, e.g., U. S. Patent Nos. 5,436,131, 6,103,462, incorporated herein by reference in their 
entireties. 

5.6 Correlating Mutations with Impaired Replication Capacity 

Any method known in the art can be used to determine whether a mutation is 
correlated with an impaired replication capacity. In one embodiment, P values are used to 

10 determine the statistical significance of the correlation, such that the smaller the P value, the 
more significant the measurement. Preferably the P values will be less than 0.05. More 
preferably, P values will be less than 0.01 . P values can be calculated by any means known 
to one of skill in the art. In one embodiment, P values are calculated using Fisher's Exact 
Test. See, e.g., David Freedman, Robert Pisani & Roger Purves, 1980, Statistics, 

1 5 W. W. Norton, New York. 

5.7 Detecting Impaired Replication Capacity 

In another aspect, the present invention provides a method for detecting impaired 
replication capacity in a virus. Impaired replication capacity-associated mutations can be 
identified using any suitable method, as described above. The presence of an impaired 

20 replication capacity-associated mutation in a virus can be detected by any means known in 
the art, e.g., as discussed above. The presence of an impaired replication capacity-associated 
mutation in the virus indicates that the virus has an increased likelihood of having an 
impaired replication capacity. In one embodiment, the virus is human immunodeficiency 
virus (HIV). In another embodiment, the virus is human immunodeficiency virus type-1 

25 (HIV-l). In another embodiment, the anti-viral treatment is a NNRTI. 

In another embodiment, the invention provides a method for determining whether a 
HIV has an increased likelihood of having an impaired replication capacity, comprising 
detecting in the RT of said HIV the presence or absence of a mutation associated with 
impaired replication capacity at amino acid position 98, 100, 101, 103, 106, 108, 179, 181, 
30 188, 190, 225 or 236 of an amino acid sequence of said RT, wherein the presence of said 

mutation indicates that the HIV has an increased likelihood of having an impaired replication 
capacity. In another embodiment, the mutation is not P236L. 
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In another embodiment, the invention provides a method for determining whether a 
HIV has an increased likelihood of having an impaired replication capacity, comprising 
detecting in the RT of said HIV the presence or absence of a mutation associated with 
impaired replication capacity selected from the group consisting of A98G, LI 001, K101E, 
5 K103N, V106A, V106I, V106M, Y181C, Y188A, Y188C, Y188H, Y188L, G190A, G190C, 
G190E, G190T, G190V, G190Q, G190S and G190V, wherein the presence of said mutation 
indicates that the HIV has an increased likelihood of having an impaired replication capacity. 

In another embodiment, the invention provides a method for determining whether a 
HIV from a subject has an increased likelihood of having an impaired replication capacity, 
10 comprising detecting in the RT of said HIV the presence or absence of a mutation associated 
with impaired replication capacity at amino acid position 98, 100, 101, 103, 106, 108, 179, 
181, 188, 190, 225 or 236 of an amino acid sequence of said RT, wherein the presence of said 
mutation indicates that the HIV has an increased likelihood of having an impaired replication 
capacity. In another embodiment, the mutation is not P236L. 

15 In another embodiment, the invention provides a method for determining whether a 

HIV from a subject has an increased likelihood of having an impaired replication capacity, 
comprising detecting in the RT of said HIV the presence or absence of a mutation associated 
with impaired replication capacity selected from the group consisting of A98G, LI 001, 
K101E, K103N, V106A, V106I, V106M, Y181C, Y188A, Y188C, Y188H, Y188L, G190A, 

20 G190C, G190E, G190T, G190V, G190Q, G190S and G190V, wherein the presence of said 
mutation indicates that the HIV has an increased likelihood of having an impaired replication 
capacity. 

6. EXAMPLES 

6.1 Example 1: Measuring Replication Fitness Using Resistance Test Vectors 

25 This example provides methods and compositions for accurately and reproducibly 

measuring the replication fitness of HIV- 1 . The methods for measuring replication fitness 
can be adapted to other viruses, including, but not limited to hepadnaviruses (e.g., human 
hepatitis B virus), flavi viruses (e.g., human hepatitis C virus) and herpesviruses (e.g., human 
cytomegalovirus). This example further provides a method for measuring the replication 

30 fitness of HIV- 1 that exhibits reduced drug susceptibility to reverse transcriptase inhibitors 
and protease inhibitors. The methods for measuring replication fitness can be adapted to 
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other classes of inhibitors of HIV- 1 replication, including, but not limited to integration, virus 
assembly, and virus attachment and entry. 

Replication fitness tests are carried out using the methods for phenotypic drug 
susceptibility and resistance tests described in US Patent Number 5,837,464 (International 
5 Publication Number WO 97/273 19) which is hereby incorporated by reference in its entirety. 

Patient-derived segment(s) corresponding to the HIV protease and reverse 
transcriptase coding regions were either patient-derived segments amplified by the reverse 
transcription-polymerase chain reaction method (RT-PCR) using viral RNA isolated from 
viral particles present in the serum of HIV-infected individuals or were mutants of wild type 

10 HIV-1 made by site directed mutagenesis of a parental clone of resistance test vector DNA. 
Resistance test vectors are also referred to as "fitness test vectors" when used to evaluate 
replication fitness. Isolation of viral RNA was performed using standard procedures (e.g. 
RNAGENTS™ Total RNA Isolation System, Promega, Madison WI or RNAzol, Tel-Test, 
Friendswood, TX). The RT-PCR protocol was divided into two steps. A retroviral reverse 

15 transcriptase (e.g. Moloney MuLV reverse transcriptase (Roche Molecular Systems, Inc., 
Branchburg, NJ), or avian myeloblastosis virus (AMV) reverse transcriptase, (Boehringer 
Mannheim, Indianapolis, IN)) was used to copy viral RNA into cDNA. The cDNA was then 
amplified using a thermostable DNA polymerase (e.g. Tag (Roche Molecular Systems,Inc, 
Branchburg, NJ), Tth (Roche Molecular Systems, Inc., Branchburg, NJ), PRIMEZYME™ 

20 (isolated from Thermus brockianus, Biometra, Gottingen, Germany)) or a combination of 
thermostable polymerases as described for the performance of "long PCR" (Barnes, W.M., 
1994, Proc. Natl. Acad. Sci, USA 91, 2216-20) (e.g. Expand High Fidelity PCR System (Taq 
+ Pwo), (Boehringer Mannheim. Indianapolis, IN) OR GENEAMP XL™ PCR kit (Tth + 
Vent), (Roche Molecular Systems, Inc., Branchburg, NJ)). 

25 PCR primers were designed to introduce Apal and Agel recognition sites into both 

ends of the PCR product, respectively. 

Fitness test vectors incorporating the "test" patient-derived segments were constructed 
as described in US Patent Number 5,837,464 (see Figure 1) using an amplified DNA product 
of 1.5 kB prepared by RT-PCR using viral RNA as a template and oligonucleotides PCR6 
30 (#1), PDSApa (#2) and PDSAge (#3) as primers, followed by digestion with Apal and Agel 
or the isoschizomer PinAl . To ensure that the plasmid DNA corresponding to the resultant 
fitness test vector comprises a representative sample of the HIV viral quasi-species present in 
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the serum of a given patient, many (>100) independent E. coli transformants obtained in the 
construction of a given fitness test vector were pooled and used for the preparation of plasmid 
DNA. 

A packaging expression vector encoding an amphotrophic MuLV 4070A env gene 
5 product enables production in a fitness test vector host cell of fitness test vector viral particles 
which can efficiently infect human target cells. Fitness test vectors encoding all HIV genes 
with the exception of env were used to transfect a packaging host cell (once transfected the 
host cell is referred to as a fitness test vector host cell). The packaging expression vector 
which encodes the amphotrophic MuLV 4070A env gene product is used with the resistance 
10 test vector to enable production in the fitness test vector host cell of infectious pseudo typed 
fitness test vector viral particles. 

Fitness tests performed with fitness test vectors were carried out using packaging host 
and target host cells consisting of the human embryonic kidney cell line 293. 

Fitness tests were carried out with fitness test vectors using two host cell types. 
15 Fitness test vector viral particles were produced by a first host cell (the fitness test vector host 
cell) that was prepared by transfecting a packaging host cell with the fitness test vector and 
the packaging expression vector. The fitness test vector viral particles were then used to 
infect a second host cell (the target host cell) in which the expression of the indicator gene is 
measured (see Figure 1). 

20 The fitness test vectors containing a functional luciferase gene cassette were 

constructed and host cells were transfected with the fitness test vector DNA. The fitness test 
vectors contained patient-derived reverse transcriptase and protease DNA sequences that 
encode proteins which were either susceptible or resistant to the antiretroviral agents, such as 
nucleoside reverse transcriptase inhibitors, non-nucleoside reverse transcriptase inhibitors 

25 and protease inhibitors. 

The amount of luciferase activity detected in the infected cells is used as a direct 
measure of "infectivity," "replication capacity" or "replication fitness," i.e., the ability of the 
virus to complete a single round of replication. Relative fitness is assessed by comparing the 
amount of luciferase activity produced by patient derived viruses to the amount of luciferase 
30 activity produced by a well-characterized reference virus (wildtype) derived from a molecular 
clone of HIV-1, for example NL4-3 or HXB2. Fitness measurements are expressed as a 
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percent of the reference, for example 25%, 50%, 75%, 100% or 125% of reference (Figures 
2, 3, 5, 6 and 7). 

Host cells were seeded in 10-cm-diameter dishes and were transfected one day after 
plating with fitness test vector plasmid DNA and the envelope expression vector. 
5 Transfections were performed using a calcium-phosphate co-precipitation procedure. The 
cell culture media containing the DNA precipitate was replaced with fresh medium, from one 
to 24 hours, after transfection. Cell culture medium containing fitness test vector viral 
particles was harvested one to four days after transfection and was passed through a 0.45-mm 
filter before being stored at -80 °C. HIV capsid protein (p24) levels in the harvested cell 

10 culture media were determined by an EIA method as described by the manufacturer (SIAC; 
Frederick, MD). Before infection, target cells (293 and 293/T) were plated in cell culture 
media. Control infections were performed using cell culture media from mock transfections 
(no DNA) or transfections containing the fitness test vector plasmid DNA without the 
envelope expression plasmid. One to three or more days after infection the media was 

15 removed and cell lysis buffer (Promega) was added to each well. Cell lysates were assayed 
for luciferase activity. Alternatively, cells were lysed and luciferase was measured by adding 
Steady-Glo (Promega) reagent directly to each well without aspirating the culture media from 
the well. 

6.2 Example 2: Measuring Replication Fitness of Viruses 
20 withDeficiencies in RT Activity 

This example provides methods and compositions for identifying mutations in reverse 

transcriptase that alter replication fitness. The methods for identifying mutations that alter 

replication fitness can be adapted to other components of HIV- 1 replication, including, but 

not limited to, integration, virus assembly, and virus attachment and entry. This example also 

25 provides a method for quantifying the affect that specific mutations reverse transcriptase have 

on replication fitness. Means and methods for quantifying the effect that specific protease 

and reverse transcriptase mutations have on replication fitness can be adapted to mutations in 

other viral genes involved in HIV-1 replication, including, but not limited to the gag, pol, and 

envelope genes. 

30 Fitness test vectors were constructed as described in Example 1 . Fitness test vectors 

derived from patient samples or clones derived from the fitness test vector pools, or fitness 
test vectors engineered by site directed mutagenesis to contain specific mutations, were tested 
in a fitness assay to determine accurately and quantitatively the relative fitness compared to a 
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well-characterized reference standard. A patient sample was further examined for increased 
or decreased reverse transcriptase activity correlated with the relative fitness observed. 

Reverse transcriptase activity of patient HIV samples : Reverse transcriptase activity 
can be measured by any number of widely used assay procedures, including but not limited to 
5 homopolymeric extension using {e.g. oligo dTrpoly rC) or real time PCR based on molecular 
beacons or 5 'exonuclease activity (Lie and Petropoulos, 1998, Curr Opin BiotechnoL 
9(l):43-48). In one embodiment of the invention, the fitness of the patient virus was 
compared to a reference virus to determine the relative fitness compared to "wildtype" 
viruses that have not been exposed to reverse transcriptase inhibitor drugs. In another 
10 embodiment, the fitness of the patient virus is compared to viruses collected from the same 
patient at different timepoints, for example prior to initiating therapy, before or after changes 
in drug treatment, or before or after changes in virologic (RNA copy number), immunologic 
(CD4 T-cells), or clinical (opportunistic infection) markers of disease progression. 

Genotvpic analysis of patient HIV samples : Fitness test vector DNAs, either pools or 
15 clones, can be analyzed by any genotyping method, e.g., as described above. In one 

embodiment of the invention, patient HIV sample sequences were determined using viral 
RNA purification, RT/PCR and ABI chain terminator automated sequencing. The sequence 
that was determined is compared to reference sequences present in the database or is 
compared to a sample from the patient prior to initiation of therapy, if available. The 
20 genotype was examined for sequences that were different from the reference or pre-treatment 
sequence and correlated to the observed fitness. 

Fitness analysis of site directed mutants : Genotypic changes that are observed to 
correlate with changes in fitness were evaluated by construction of fitness vectors containing 
the specific mutation on a defined, wild-type (drug susceptible) genetic background. 

25 Mutations were incorporated alone and/or in combination with other mutations that were 
thought to modulate the fitness of a virus. Mutations were introduced into the fitness test 
vector through any of the widely known methods for site-directed mutagenesis. In one 
embodiment of this invention the mega-primer PCR method for site-directed mutagenesis 
was used. A fitness test vector containing the specific mutation or group of mutations were 

30 then tested using the fitness assay described in Example 1 and the fitness was compared to 
that of a genetically defined wild-type (drug susceptible) fitness test vector that lacked the 
specific mutations. Observed changes in fitness can be attributed to the specific mutations 
introduced into the resistance test vector. 
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In several related embodiments of the invention, fitness test vectors containing site 
directed mutations in reverse transcriptase that resulted in amino acid substitutions at position 
98 (A98G), 100 (LI 001), 101 (K101E), 103 (K103N), 106 (V106A, V106I, V106M), 
181 (Y181C, Y188A, Y188C, Y188H, Y188L), or 190 (G190A, G190S, G190C, G190E, 
5 G190V, G190T, G190Q) and that displayed different amounts of reverse transcriptase 
activity were constructed and tested for fitness (Figures 2, 3, 4, 5, 6, 7 and 8A-8C). The 
fitness results established a correlation between specific reverse transcriptase amino acid 
substitutions and fitness. The data demonstrated that different mutations at the same position 
can have drastically different effects on replication capacity. Figure 3 shows that VI 061 and 
10 V106M had relatively high replication capacities compared to VI 06 A. The differences 

observed between different mutations at position 190 were particularly striking, ranging from 
barely detectable to greater than 80% of wild-type. These differences in replication capacity 
also were manifest when the mutant viral strains are replicated in the presence of NNRTIs, as 
shown in Figure 8A-8C. 

15 Figure 4 demonstrates that the replication capacity measurements performed as 

described above produced results that were consistent with measurements made using 
replication competition assay results. 

Viruses containing multiple mutations also were constructed and tested as shown in 
Figure 2. Figure 7 compares the replication capacities of single mutant strains and double- 
20 mutant strains comprising combinations of the single mutations and shows that strains having 
combinations of mutations had replication capacities less than strains having either mutation 
alone. 

Mutant strains comprising both a mutation at position 190 and the mutation L74V 
were also constructed as shown in Figure 5. While the L74V mutation did not affect the 
25 replication capacity of a strain that was wild-type at position 190, it did increase the 
replication capacity of most other position 190 mutations. 

6.3 Example 3: Analysis of Patient Samples to Identify 
Resistance- Associated Mutations . 

This example demonstrates a method of analyzing patient samples so as to identify 

30 mutations that are associated with NNRTI resistance. It also demonstrates that K101P, 

K103R and V179D as well as the combination of K103R and V179D are new 

NNRTI-resistance mutations. 
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In order to determine the relationship between an HIV-1 strain's reverse transcriptase 
(RT) sequence and its susceptibility to NNRTIs, a data set of 18,034 samples was analyzed 
genotypically as well as phenotypically. Those samples with NNRTI resistance in the 
absence of well-characterized NNRTI mutations were identified by univariate and 
5 combinatorial correlation analysis. 8,673 samples had no mutation at position 100, 181, 188, 
190 or 227, nor any of the following mutations in RT: A98G, K101E, K103N/S, V106A/M, 
P225H, M230L, or P236L, but still had high-level reductions in susceptibility to one or more 
NNRTIs. Of the 8,673 samples, 146 samples exhibited > 5-fold reduction in susceptibility to 
at least one NNRTI. 

10 All amino acid positions of RT where at least two samples had a mutation in an 

unmixed form no mixture of mutation and wild type) were identified. The amino acid 
positions and residues identified were: 

P4, E6, K20, T27, V35, T39, M41, K43, E44, S48, K49, 150, V60, A62, K64, D67, S68, T69, 
K70, L74, V75, F77, R83, D86, V90, 194, A98, K101, Q102, K103, K104, V106, V108, 
15 Y115, Fl 16, V118, D121, K122, D123, 1135, E138, T139, 1142, Q151, A158, C162, T165, 
Kl 66, 1167, E169, K173, Q174, P176, D177, 1178, V179, M184, V189, E194, G196, Q197, 
T200, 1202, E203, Q207, H208, L210, R211, F214, T215, D218, K219, H221, L228, D237, 
K238, P243, V245, E248, D250, 1257, A272, K275, V276, R277, Q278, K281, L283, R284, 
T286, A288, E291, V292, V293, P294 and E297. 

20 The amino acids present at each of the above positions were then determined and any 

amino acid present in more than one sample was analyzed separately. Correlation analyses 
were performed using Statview software (SAS Institute, Cary, NC, USA) to determine amino 
acid mutations correlated with each NNRTI continuous fold change (FC), or with a 
dichotomous fold change (FC >10). 

25 The data (RT mutations, FC for nevirapine (NVP), delavirdine (DLV) and efavirenz 

(EFV)) for the samples that were most resistant to NNRTIs from amongst the 146 samples 
analyzed are shown in Table 1 . Table 1 provides the RT genotypes and NNRTI FC in 
susceptibility observed in samples that exhibited a FC greater than ten with at least one 
NNRTI. The maximum FC values observed for NVP, DLV and EFV were > 400-, > 250- 

30 and > 144-fold, respectively. As illustrated in Figures 9 and 10 and Table 1, samples with 

the highest FC contained a proline substitution at position 101 (n=10, median NVP, DLV and 
EFV fold change was 355-, 26- and 26-fold, respectively) or a combination of 103R and 
179D (n=13, median NVP, DLV and EFV fold change was 12-, 24- and 17-fold, 
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respectively). The presence of K101P, K103R and V179D in the RT genotypes are indicated 
in bold in column 1 of Table 1 . 

All of the 10 IP samples, and all but 2 of the 103R/179D samples, also had mutations 
associated with nucleoside RT inhibitor (NRTI) resistance. In samples without 101P or 
5 103R/179D, the maximum FC for NVP, DLV and EFV was 41-, 67- and 15-fold, 
respectively. Mutations associated with reduced susceptibility to all three NNRTIs 
independent of 101P, 103R/179D, or NRTI-associated mutations, included 101Q, 1061, 135T, 
166R, 179D, 1891, 245T, 272S, and 297T. In some samples, surprising correlations with 
decreased resistance or increased sensitivity, and possibly hypersusceptibility to NNRTIs was , 
10 also seen at positions 245 and 138. 

Correlation of K101P, K103R/V179D and other mutations identified as NNRTI- 
resistance mutations to a decreased susceptibility to NNRTIs was verified by analyzing 
samples without NNRTI resistance. A total of 526 samples without NNRTI resistance were 
analyzed. The NNRTI fold change of these samples were all between 1 and 2, indicating no 
15 NNRTI resistance. Genotypic analysis of the 526 samples showed that none of the mutations 
identified as NNRTI-resi stance mutations were present in any of the samples, confirming that 
the NNRTI-resistance mutations were indeed associated with a decreased susceptibility to 
NNRTIs. 

This example demonstrates a method for determining whether a human 
20 immunodeficiency virus (HIV), e.g., human immunodeficiency virus type 1 (HIV-1), has an 
increased likelihood of having resistance to treatment with a NNRTI, comprising: detecting 
whether the reverse transcriptase (RT) encoded by said HIV-1 exhibits the presence or 
absence of a mutation associated with resistance to treatment with said NNRTI at amino acid 
position 101, 103 or 179 of an amino acid sequence of said RT, e.g., K101P, K103R or 
25 V179D, wherein the presence of said mutation indicates that the HIV-1 has an increased 
likelihood of having resistance to treatment with the NNRTI. In general, the methods can 
comprise detecting the presence or absence of any combinations of mutations listed herein 
associated with NNRTI resistance. For example, the method can comprise detecting the 
presence or absence of a mutation at 2 or all 3 amino acid positions associated with NNRTI 
30 resistance. In certain embodiments, such a method can comprise detecting the presence or 
absence of K103R and V179D. Examples of NNRTIs include, but are not limited to, 
delavirdine, nevirapine and efavirenz. 
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All references cited herein are incorporated by reference in their entireties. 

The examples provided herein, both actual and prophetic, are merely embodiments of 
the present invention and are not intended to limit the invention in any way. 
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